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A bank of bandpass filters is often used in performing short-time spectrum 
analysis of speech signals. This paper is concerned with the analysis and 
design of digital filter banks composed of equally spaced bandpass filters. 
It is shown that significant improvement in the composite filter bank 
response can be achieved by proper choice of the relative phases of the band- 
pass filters. The results are extended to more general filter bank configura- 
tions. 

I. INTRODUCTION' 

Many speech processing systems are based on the concept of short- 
time spectrum analysis. 1,2 Spectrum analyzers for such systems often 
consist of a set of bandpass filters Whose combined passbands cover a 
desired frequency range. Although continuous-time filters have tradi- 
tionally been used in filter banks for speech analysis, hardware realiza- 
tions of digital filters are now available, 3 and the advantages which 
digital filters offer should be exploited in filter bank design. These 
advantages include: flexibility of design of the individual bandpass 
filters, precision of realization, stability of digital hardware, and the 
efficiency of realization of the filter bank afforded by the possibility 
of multiplexing the digital hardware. Thus it is important to consider 
design techniques for filter banks composed of digital bandpass filters. 

To focus on the basic concepts in filter bank design, it is useful to 
define an ideal filter bank spectrum analyzer. Figure 1 depicts such a 
filter bank composed of digital filters whose impulse responses are 
denoted by h k (nT), k = 0, 1, • • • , M, where l/T is the sampling 
frequency of the input signal.* Such a filter bank constitutes an ideal 
spectrum analyzer if the input x(nT) (with possibly further band limit- 
ing) can be synthesized exactly (within some fixed delay) by a linear 

* The filter ho(nT) is a lowpass filter which is included for completeness although 
this band is usually not analyzed in practical speech analysis systems. 
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Fig. 1 — Bank of digital bandpass filters. 



combination of the bandpass filter outputs x k (nT). An example of 
such a system would be one in which the filters are ideal rectangular 
bandpass filters with the same constant gain and linear phase in their 
passbands and zero gain outside. If the filter bandwidths are such 
that the frequency range 

is completely covered without overlap, then the input can be synthesized 
exactly by adding together the outputs of the bandpass filters. 

The essential characteristics of the ideal spectrum analyzer are that 
the frequency response of the combined outputs must exhibit a flat 
magnitude response and a linear phase, and therefore the combined 
impulse response must be a delayed digital impulse (unit sample). 
Causal digital filters (filters whose impulse responses are zero for 
n < 0) cannot have the desired ideal gain characteristics and may 
not have linear phase.* Therefore a filter bank composed of such filters 
cannot achieve the ideal characteristics of flat magnitude response and 
linear phase. In this paper we will describe an approach to the design 
of filter banks that approximate the ideal spectrum analyzer. First 
we present a detailed analysis of a filter bank configuration in which 
equally spaced, equal-bandwidth digital filters are used. This analysis 
suggests a technique for optimizing the filter bank characteristics and 

* Finite duration impulse response digital filters can have precisely linear phase. 
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also suggests how the results can be used where nonuniform bandwidths 
are desired. The results are illustrated with examples. 

II. ANALYSIS OF UNIFORM FILTER BANKS 

Assume that the bandpass filters in Fig. 1 have impulse responses 
of the form 

h k (nT) = 2 | D k | h(nT) eos (a> k nT + <S>*) k = 1,2, ■■■ ,M 

h (nT) = | D | h{nT) 

where w k = g>i + (fc — 1) Aco, and h{nT) is the impulse response of a 
prototype lowpass filter. (When the fc = lowpass filter is used, «i = Aa> 
and coo = 0.) The system functions for this set of bandpass filters are 

H k (z) = | D k | e i * t H(ze- ia ' T ) + | D k \ e'^Hize^^) k - 1, 2, ■ ■ • ,M 

H (z)= \D \H(z). (2) 

The frequency response of these filters is obtained after substituting 
z = e 1 u T in (2) as 

H k (e iaT ) = | D k \e i * k H{e iia - a » )T ) + | D k \ e - i * t H(e Hu + u " )T ), 

k = 1,2, ... ,M, (3) 

# (O = I D„ \H{e iuT ). 

If the frequency response of prototype lowpass filter, H(e' uT ), drops 
off sharply, then it can be seen from (3) that 

| H k (e iuT ) \&\D k \\ H(e i{a -" k)T ) \ ^ a ^ w/T 

(V\D k \\ H(e i{w + Uk)T ) | - t/T ^ co :g 0. 

In this case the filter bank consists of a set of (M + 1) equally spaced 
bandpass filters with identical magnitude responses around their 
respective center frequencies. We have chosen this method of designing 
bandpass filters from lowpass prototypes for analytical convenience 
and because of the importance of spectrum analysis systems of this 
form. 1,2 The results to be discussed apply for other bandpass trans- 
formations in so far as they yield a set of uniformly spaced bandpass 
filters with identical frequency characteristics. 

Our objective is to choose the prototype lowpass filter and the 
parameters \ D k \, coi , Aw, and * t so that the filter bank will closely 
approximate the characteristics of the ideal spectrum analyzer. To do 
this we must consider the response of the composite filter bank. First, 



(5) 



(6) 
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however, it is useful to interpret the individual bandpass filter outputs 
in terms of spectrum analysis considerations. 
The individual filter outputs are of the form 

x k (nT)= E MrT) \ D k \ h(nT - rT) cos [u> k (nT - rT) + SJ, (4) 

x Q {nT) = 2 x(rT) \ D \ h(nT - rT) 

which can be expressed as 

x k (nT) = 2 Re \D k X(w k , n)e iUknT ] 
x (nT) = Re {D X(0,n)\ 
where D k is the complex constant defined by 

D k = \D k \e iik k = 1, 2, ■•• ,M 
D = | Do \, 
and 

X(a, k , n) = £ x(rT)h(nT - rT)e i( ° kTT . (7) 

r- — oo 

The quantity X(u k , n) is the discrete-time version of the short-time 
Fourier transform 4 of x(nT). Thus, (5) serves to relate the bandpass 
filter outputs x k (nT) to the short-time Fourier transform. 

The frequency response and impulse response of the composite filter 
bank are obtained from 

M 

v(nT) = I>*(nT). (8) 

A- = 

After substituting (5) into (8) and noting that if x(nT) is real, X(- w k , n) 
is the complex conjugate of X{o) k , n), we obtain 

y(nT) = £ D k X(u, k ,n)e iu * nT (9) 

k — M 

where w_* = — w k , and D- k is the complex conjugate of D k . Substituting 
(7) into (9) and interchanging the order of summations results in 

y(nT) = E x(rT)\h(nT - rT)- £ Dj** (nT - rT) • (10) 

Defining 
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d(nT) = £ ZV tnT , (11) 

k = -.\t 

we observe from (10) that the combined impulse response of the filter 
bank can be expressed as 

Ii(nT) = h(nT) d{nT). (12) 

Equations (11) and (12) are the basic results of the analysis. (Note 
that they could have been obtained directly by summing the impulse 
responses h k (nT), with the sacrifice of the interpretation of the filter 
bank outputs in terms of the short-time spectrum.) 

Equation (12) shows that the filter bank impulse response is the 
product of the prototype lowpass filter impulse response h(nT), and 
the sequence d(nT) defined by (11). The choice of lowpass filter depends 
on both the desired frequency resolution and the requirement of obtain- 
ing flat magnitude and linear phase response in the composite filter 
bank. The sequence d(nT) Is independent of the prototype lowpass filter 
and is a function of the frequency spacing, the relative gains and phases, 
and the number of bandpass filters. Thus, for a given choice of prototype 
lowpass filter, the parameters of d(nT) can be adjusted to achieve the 
best approximation to the ideal spectrum analyzer. To see how this 
occurs, we shall first examine in detail the characteristics of the sequence 
d{nT). 

As will be shown in the remainder of this section, a particularly useful 
choice of the complex coefficients D k in (11) is 

D k =e ia *" T k = ±1, ±2, ••• ,±M. 
D = 1, 
where n is an integer. That is, 

$ A . = w k n T, and \ D k \ = 1. 

(The condition D = 1 implies that the band around w = is included 
in the filter bank; D Q = implies that it is not.) It can be shown that 
for this choice of D k , (11) becomes 

where co„ = w, + (71/ — 1) Aw/2, and D is 1 or depending on whether 
or not the lowpass channel is included. 
The properties of the sequence d(nT) determine the character of 



3102 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1971 

the impulse response of the filter bank. Some of these properties are 
summarized below: 

(i) The parameter n shifts the sequence d(n T) by n samples with 

respect to h(nT). 
(ii) The sequence d(nT) is even about the sample n = —n ; i.e., 

d{nT - n a T) = d(-nT - n T). 
(Hi) The maximum value of d{nT) occurs at n = —n . This value 

is d(-n T) = 2M + D . 
(iv) If 27r/(Aco7 1 ) and a^/Aw are both integers, then the sequence 
d(nT) is periodic with period 2ir/Au. Otherwise, d(nT) will be 
an almost periodic sequence which will peak up at time intervals 
of 27r/Ao>. 

Insight into the properties of d(nT) can be gained by considering a 
simple example. Assume that a>, = Aw and 2t/(Aq)T) = N where N is 
an odd integer. That is, the entire frequency range —ir/T ^ u ^ w/ T, 
is divided into N equal bands. If M = (N — l)/2, the entire frequency 
range is covered. Under these conditions (14) can be written 



sin 



2M " h ' ' Aa,(nT + n T) 



d(nT) = - L \ '{ . ' , t^t-, - 1 if D = (15a) 

v ' sin [Ao)(nT + n T)/2] 



sin 



(m±i) MnT + neT) ] 



sin [Aa>(nT + n T)/2] 



if Do = 1. (15b) 



(If N is even, the k = filter is not used, i.e., D = 0, and co, is Ato/2.) 
It is clear from (15b) that for these conditions d(nT) is a periodic 
sequence with period NT = 2tt/ Ao>. In fact, d(nT) may be thought of as 
samples of a continuous-time periodic Dirichlet kernel as shown in 
Fig. 2a. If M = (N — l)/2 and D = 1, d(nT) is a periodic discrete- 
time impulse train with impulses occurring at multiples of NT. This 
is because the sample points on the periodic Dirichlet kernel occur at 
the maxima and the zero crossings, as indicated by the small circles in 
Fig. 2a. 

The conditions for d(nT) to be periodic are that both wj/Aw and 
27^/(Aw7 , ) be equal to integers. To see this, we must examine (14) in 
detail. If 2ir/(AuiT) is an integer, and M is an odd integer, the sequence 
2 sin [M Au(nT + n T)/2]/sin [Aoj(nT + n„T)/2] is periodic with 
period NT = 2t/Au>. If wi/Aw is an integer and M is odd, the sequence 
cos [(cj, + (M — 1) Au)/2)(nT + n T)] is also periodic with a period 
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Fig. 2 — (a) Periodic continuous-time Dirichlet kernel, (b) continuous-time en- 
velope and sequence d{nT) when either wi/Aoj or 2v/(AaT) are not integers. 



that is an integer multiple of 27r/Aa>. Thus the product of these two 
sequences is periodic with period 27r/Ao>. The identical result holds 
for M an even integer and 27r/Ao> and wi/Aw integers although the 
interaction between the component sequences is slightly different. 

If either 2ir/(AuT) or co,/Aw are not integers, d(nT) will not be 
periodic, but will still peak up at time intervals of 2tt/Au. Such a case 
is depicted in Fig. 2b where the samples d(nT) are marked by the 
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small circles and the dotted curve shows the factor 

sin [M Aa>(nT + n T)/2]/sm [Ao>(nT + n T)/2] 

when M is odd. As shown in Fig. 2b, d(nT) will always have even sym- 
metry about sample n = —n . 

III. DESIGN OF UNIFORM FILTER BANKS USING BESSEL FILTERS 

In the preceding section we presented a detailed analysis of a filter 
bank composed of equally spaced equal-bandwidth filters. In this 
section we will show how the results of that analysis can be employed 
in filter bank design. 

The objective of flat amplitude response and linear phase is most 
easily achieved with bandpass filters having these same properties. 
For this reason, Bessel (maximally flat delay) filters are often used in 
filter banks. 5 In the examples shown in this paper, we have used digital 
filters obtained from Bessel prototype designs using impulse invariance. 8 
It should be noted that the digital filters obtained this way do not have 
the maximally flat delay property. J. P. Thiran 7 has shown that the de- 
nominator of the system function of maximally flat delay digital filters 
is a Gauss hypergeometric function. It is reasonable to expect however, 
that for the narrow-band filters of interest here, the differences should 
be negligible. 

As an example a digital filter derived from a sixth-order Bessel 
lowpass filter with asymptotic cutoff frequency of 60 Hz is shown 
in Fig. 3. The impulse response is shown in Fig. 3a, and the amplitude 
and phase responses are shown in Fig. 3b and Fig. 3c. The filter shown in 
Fig. 3 was used in a filter bank* with the following choice of parameters: 
D = 0, T = 10" 4 sec, Aco = 2tt(100), w, = 2tt(100), n = 0, and 
M = 30. The resulting filter bank characteristics are shown in Fig. 4. 
The filter bank impulse response, h(nT), is shown in Fig. 4a along 
with the prototype lowpass impulse response h(nT). For the above 
choice of parameters, d{nT) is obtained from (15a) as 

j/ m\ s in [0.6l7rw] t / 1A >. 

d ( nT ) = • rn ai — i - *> ( 16 

sin [O-Obrw] 

which is periodic with period 100 samples (10 msec), with peaks occurring 
at nT = 0, ±10, ±20, • • • msec. From Fig. 4a, it can be seen that 
in the product h{nT) ■ d{nT) , the peak of d{nT) at nT = will be 
attenuated since h(nT) is small around nT = 0. On the other hand, 



Note that the resulting bandpass filters are twelfth order. 
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Fig. 3 — Sixth-order Bessel filter characteristics, (a) Impulse response, (b) mag- 
nitude response, (c) phase response. 
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Fig. 4 — Characteristics of 30-channel filter bank, (a) Impulse response (dotted 
curve is the impulse response of the prototype lowpass filter in Fig. 3), (b) composite 
magnitude response, (c) composite phase response after subtracting 10-msec delay. 
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the peak of d(nT) at nT = 10 msec occurs at approximately the peak 
of h(nT), and at nT = 20 msec, h(nT) is large enough to produce a 
significant echo in the impulse response of the filter bank. As is shown 
in Fig. 4b and 4c, this corresponds to a 3.9-dB ripple in the amplitude 
response and a 25.5-degree peak-to-peak ripple in the phase response 
(after removing a linear phase component corresponding to a 10-msec 
or 100-sample delay). To decrease this amplitude and phase ripple, we 
should attempt to eliminate the echo in the impulse response. Further- 
more, the phase ripple will be eliminated if the impulse response h(nT) 
has even symmetry about some delay time n D T. One approach is to 
broaden the filter bandwidths, or equivalently reduce the spacing Aa>, 
so that h(nT) is contracted relative to the spacing of pulses in d(nT). 
This is generally not an acceptable solution since hinT) and Aw are 
usually fixed by some frequency resolution criterion. However, if we 
refer to the properties of d(nT) which were previously summarized, 
we note that a negative value of n will shift d(nT) to the right relative 
to h(nT) so that d(nT) will have even symmetry about time n D T = 
—n T. If n can be chosen so than h(nT) = h(nT)-d{nT) has approxi- 
mately even symmetry and consists of only one significant pulse, then 
the amplitude and phase ripple will be small. The manner in which this 
is achieved is shown in Fig. 5 where it is assumed for simplicity that 
d(nT) is a train of digital impulses as would be the case for M = (N — 
l)/2. Figure 5a depicts the case where n = 0. Figure 5b shows the 
situation where n was chosen to shift the impulse which was at n T = 
in Fig. 5a to the right and into the vicinity of the peak of h(nT). If it 
is assumed that only three impulses have nonzero amplitudes (a l , 
a 2 , a 3 ) such that 4 | a, | • | a : , | < | a x + a 3 | ■ | a 2 |, then it can be shown 
(see Appendix) that the peak-to-peak amplitude ripple of the filter 
bank is 

«, = '20 1og,„[ |°' + ° l+a3 | l- (17) 

\_\a 2 — <*i — a 3 I J 

Similarly, if | a, + a 3 | < \a 2 \, the peak-to-peak phase ripple about 
a linear phase corresponding to a delay of —n T is given by 

R P = 2 tan -1 7-5 7 — , 3 -. 2 <.i/2 • (18) 

L(a 2 — (<*i + oc 3 ) ) J 

The conditions for (17) and (18) to hold are satisfied when a, and a 3 
are small relative to a 2 , which is the normal situation. It can be seen 
from (18) and (17) that the phase ripple will be zero if «i = a* , and 
the amplitude ripple will be small if (a, + a 3 )/a 2 is small. 

Although these results were derived for the idealized case when d(nT) 
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Fig. 5 — Illustration of how to adjust the parameter no. (a) Composite impulse 
response for n = 0, (b) n chosen to minimize magnitude and phase ripple (dotted 
lines indicate movement of individual pulses in d(nT)). 



is an impulse train, we have found that amplitude and phase ripple 
can be determined quite accurately using (17) and (18) in more general 
situations. With the foregoing principles in mind we have written an 
interactive computer program for filter bank design. Using this program 
we can design a filter bank with low amplitude and phase ripple by 
the following process: 

(i) Choose o>, , Aw, and M to cover the desired analysis band and 

choose h(nT) to provide desired frequency resolution. This 

results in an h{nT) that has a duration of approximately 47r/Au 

as shown in Fig. 5. 

(ii) Evaluate h(nT) and determine n such that a, « a 3 as in 

Fig. 5b. 
(in) If the resulting filter bank is not satisfactory, steps i and ii 
are repeated. 

In cases where wj/Aco is not an integer, it is important to choose n 
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so that the point of even symmetry in d(nT) is shifted into the vicinity 
of the peak of h()iT). Otherwise, it may be impossible to achieve a very 
good approximation to linear phase. An example of the improvement 
gained by proper choice of n is shown in Fig. 6. In this example all 
the parameters were the same as in the example of Fig. 4 except a value 
of >/ = — 129 was chosen by the above process. In this case the in-band 
amplitude ripple is 0.S dB and the phase ripple is 0.6 degree, as compared 
to 3.S dB and 25.5 degrees when ?(„ = 0. 

R. M. Golden 5 has shown that inverting the sign of alternating chan- 
nels often significantly improves the characteristics of a filter bank. This 
technique has a simple interpretation in terms of our results. It can be 
shown that inverting the sign of alternating channels is equivalent to 
delaying the sequence diriT) by n = — w/(AuT) samples. This amount 
of delay may be nearly correct if the duration of h(nT) is approxi- 
mately 37r/Aco; however for the situation shown in Fig. 4a, such a delay 
would produce a worse filter bank than no delay at all (n„ = 0). Also, 
to achieve linear phase when o>,/Aco is not an integer, the point of even 
symmetry in d(nT) should be delayed to the vicinity of the peak of 
fi(nT). This does not occur when the signs of alternate channels are 
inverted. 

IV. DESIGN" OF NONUNIFORM BANDWIDTH FILTER BANKS 

In speech applications it is common to take advantage of the fre- 
quency resolution characteristics of the ear 4 ' 5 by using increasing 
bandwidth filters at higher frequencies. The previously discussed tech- 
niques can be applied to this situation if the filter bank consists of 
several sub-banks, each with different resolution. Each sub-bank can 
be designed as discussed above, with care being taken to ensure that 
the entire frequency band of interest is covered by the combination 
of the sub-banks. It may be necessary to equalize the delay between 
sub-banks by providing additional delay for all but one of the sub- 
banks.* This is depicted in Fig. 7 for three sub-banks with increasing- 
bandwidth sixth-order Bessel filters. Figure 7a shows the lowpass 
prototype impulse response and shifted d(nT) sequence t for the first 
sub-bank. The lowpass asymptotic cutoff used was 78 Hz, the spacing 
of filters was Aw, = 2tt(125), the first filter was centered at w n = 
2ir(250), and a value of n 0i = —100 (10-msec delay) was required to 

* Golden 5 has shown that the delays can be approximate!}' equalized by increasing 
the order of the lowpass prototype in direct proportion to the increase in bandwidth. 

f The sequence d(nT) is shown as an impulse train for convenience in plotting. 
The actual sequences would look like those in Fig. 4 and Fig. 6. 
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Fig. 6 — Characteristics of 30-channel filter bank, (a) Impulse response for n = 
— 129 (dotted curve is the impulse response in Fig. 3), (b) composite magnitude 
response, (c) composite phase response after subtracting 12.9-msec delay. 
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minimize the amplitude and phase ripple. Figure 7b shows the second 
sub-bank in which the basic parameters were: lowpass asymptotic 
cutoff 136 Hz, Aco 2 - 2tt(218), a> 12 = 2tt(1296.5), and n 02 = -57 
(5.7-msec delay). To line up the central peaks, an additional delay 
of n 2 = 43 samples (4.3 msec) was required. Figure 7c shows the third 
sub-bank where the lowpass cutoff was 192 Hz, Aw 3 = 27r(307), a>, 3 = 
2ir(2213), and n 03 = — 40 (4-msec delay). A value of n 3 = 60 samples 
(6.0 msec) is required to line up the central peak with those in Fig. 7a 
and 7b. The response of the combination of these three sub-banks is 
shown in Fig. 8. Figure 8a shows the impulse response, Fig. 8b shows 
the amplitude response, and Fig. 8c shows the phase after a linear 
phase corresponding to 10-msec delay has been subtracted. It can be 
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nT(msec) 

Fig. 7 — Illustration of the design of nonuniform filter banks: (a) impulse response 
for narrow bandwidth filters, (b) impulse response for intermediate bandwidth filters, 
(c) impulse response for wide bandwidth filters. 
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Fig. 8 — Composite filter bank characteristics with three different aub-banks : (a) 
impulse response, (b) magnitude response, (c) phase response after subtracting 
10-msec delay. 
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seen in Fig. 8b and Sc that the ripple in the sub-banks is quite low 
as would be expected from Fig. 7. At the boundary between sub-banks, 
however, the ripple increases significantly due to the fact that the last 
filter in the lower sub-bank drops off more rapidly than the first filter in 
the next sub-bank. This excessive variation at the boundary between 
sub-banks can be eliminated to some extent by using increasingly 
higher-order filters in the sub-banks. Alternatively, nonuniform resolu- 
tion can be obtained by using equal-bandwidth filters and adding 
together groups of two or more of their outputs to achieve the desired 
bandwidth. Such an approach would require increased computation 
but would produce filter bank characteristics comparable to those in 
Fig. 6. 

V. CONCLUSION 

We have discussed the analysis and design of digital filter banks 
and have shown how the incorporation of a linearly increasing phase 
shift in each bandpass filter can significantly improve the overall filter 
bank characteristics. We also showed how the techniques can be used in 
nonuniform bandwidth filter banks. 

The examples which we gave were based on Bessel lowpass prototypes 
which have impulse responses of desirable shape but rather poor am- 
plitude response. Recent results in the design of finite duration impulse 
response filters 8 offer attractive possibilities for filter bank design. 
Such filters can have precisely linear phase and can be designed using 
iterative techniques with constraints on both the impulse response 
shape and the amplitude response. The use of such filters, together with 
the basic principles discussed in this paper, should yield filter banks 
with excellent properties. 

APPENDIX 

Derivation of Magnitude and Phase Ripple Formulas 
Assume an impulse response sequence 

h(n) = a, n = 
= a 2 n = n p 
= a 3 n = 2n p 

= elsewhere. (19) 

The system function of this system is 

H(e imT ) - a, + a 2 e- iun " T + a* '"*'*. (20) 
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The squared magnitude response is 

I H(e iuT ) | 2 = [a a + (a, + a 3 ) cos (con p T)] 2 + (a, - a 3 ) 2 sin 2 (con.T), 

(21) 
and the phase response is 

arg [H(e'» T )] = tan" 1 [ £\~ * U ™ ^ „ ] < 22 > 

L« 2 + (a, + a s ) cos (wn„T) J 

where a linear phase component —u)7i p T has been removed. Clearly, 
both (21) and (22) are periodic functions of u with period 2ir/n„T. 
To determine the amplitude and phase ripple, we must locate the 
maxima and minima of (21) and (22). 

If we differentiate (21) with respect to to, we find that the maxima 
and minima occur for values of oj satisfying 

sin (om p T) = (23a) 

cos (om p T) = -a 2 faL+gd . ( 23 b) 

The second equation is satisfied by a real value of co if and only if 

4 | a, | • | a 3 | > | «i + a 2 | ■ | a 2 |. (24) 

In a good filter bank design, a, and a 8 will be positive and much smaller 
than a 2 , and (24) will not be satisfied. Evaluating the second derivative 
shows that in this case the maxima and minima of \ H(e"° T ) | will 
alternate and occur at values of to satisfying (23a); i.e., to = 0, ±ir/n p T, 
±2ir/n p T, ■ ■ • .In this case the amplitude ripple in dB is given by 



B, = 201og, o r| a » + a ' +a »j 1. 

\_\a 2 — a, — a 3 J 



(25) 



If (22) is differentiated with respect to to, we find that the maxima 
and minima occur at values of u satisfying 



T= _(^±a 1 \ (26) 



cos ton 

a 2 

Equation (26) is satisfied by real values of a if | a x + a* \ < | a 2 |. 
In this case the maxima and minima again alternate, and the peak- 
to-peak phase ripple is 

R p = 2 tan" 1 F7- 2 3 ~V* ^, J ■ (27) 

_(a 2 — fa, + a 3 ) ) J 
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If | cti + a a | > | a* |, the phase curve will be discontinuous with a 
jump of 2ir radians occurring at to = iir/n^T, ±3ir/n„T, ■ ■ • . 
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The Preference of Slope Overload to 
Granularity in the Delta Modulation of 

Speech 

By N. S. JAYANT and A. E. ROSENBERG 

(Manuscript received June 18, 1971) 

A preference study was made to assess the relative annoyance values 
of slope-overload distortion and granular noise in delta-modulated speech. 
A recently described adaptive delta modulator was simulated at frequencies 
of 20 and 40 kHz, and controlled amounts of the two types of degradation 
were introduced into samples of a 2-second utterance. Rankings were 
obtained for these samples on the basis of preference judgments of nine 
listeners, each of whom assessed the samples, pairwise, in a tournament- 
type strategy. Results indicate that the speech sample exhibiting the mini- 
mum degradation on an objective, overall-noise-power basis is not subjectively 
the most preferred sample. Furthermore, the subjectively optimum delta 
modulator exhibits greater overload and lesser granularity than the ob- 
jectively optimum device. 

I. INTRODUCTION 

The principle of delta modulation 1 has been widely described in the 
literature. Briefly, delta modulation is a digital encoding strategy 
which uses a simple feedback mechanism to produce a "staircase" 
approximation to an input signal. A block diagram of the simplest form 
of delta modulation appears in Fig. 1. The input sequence {A' r } is 
usually band-limited and suitably oversampled. The "staircase" se- 
quence Y r is generated according to the equations 

C T - sgn (A\ - I' r _,) (1) 

Y r - F r _, = m r = A r C r . (2) 

The step-size A r is assumed to be a constant in conventional (linear) 
delta modulation. "Adaptive" delta modulation, on the other hand, 
allows for modifications of A r in accordance with the changing slope 
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Fig. 1 — Schematic diagram of a linear delta modulator. 

characteristics of the input signal. Such adaptation results in better 
encoding, and several types of adaptive delta modulation have been 
described in the literature. 2,3,4 

Figure 2 illustrates the mechanism of an adaptive delta modulator 
and demonstrates how suitable increases and decreases of step size 
facilitate better encoding during steep and flat regions of the input 
signal waveform. Such adaptations can be effected by observations on a 
"recent" segment of the binary sequence \C T ); this is illustrated by 
equation (5) in the sequel. 

Figure 2 also brings out the distinction between two types of encoding 
error in delta modulation, viz., "granular noise" and "slope-overload" 



"HUNTING" OR "GRANULAR" 
NOISE 



"SLOPE-OVERLOAD" 
DISTORTION 




Y r - Y r _, = m r 
Xp- Xp_ j == Sf 

Xt-Y t -E tc . Xt-rYt-^Eft^),, 

sgn [x r -Y r _ 1 ] = C r =sgn [m r ] 

C r = TRANSMITTED "CHANNEL" SYMBOL 



Fig. 2 — Illustration of adaptive delta modulation. 
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distortion. A given error sample 

E r = X r - Y T (3) 

can be denned to fall into the granular or slope-overload category, 
depending on whether the corresponding step m r crosses the input 
waveform or not. Thus, in Fig. 2, there is a 'granular' error E, a at the 
sampling instant t, and an 'overload' error 2? {1 _ 1)0 at the sampling 
instant (i - 1). As a matter of definition, we will note that E t0 = 

E it -uo = 0. 

The signal output { Z r } of the delta modulator is actually obtained by 
filtering the staircase sequence { Y r \ to the input signal band. Let 
[X F r \ be the result of passing {X r } through the same lowpass filter. A 
perceptually relevant measure of signal degradation is accordingly 
defined by the encoding error 

e r = X p r - Z T . (4) 

As with the quantity E T in (3) , one can distinguish samples of granularity 
and slope overload, e rG and e r0 , in the error sequence \e T \. Referring 
to Fig. 2 once more it can be seen that a physical distinction between 
the two types of error is suggested. Granularity can be described as a 
"signal-uncorrelated" random noise-type of phenomenon. It is char- 
acterized by alternation of signs and tends to be independent of signal 
amplitude. Slope overload, on the other hand, can be described as a 
"signal-correlated" distortion, since its sign and magnitude are related 
to the slope of the signal. This physical difference between slope overload 
and granularity suggests a corresponding perceptual distinction and 
raises the question of the relative annoyance values of the two forms 
of signal degradation in delta modulation. The present paper describes 
a study of the above question as referred to the delta modulation of a 
speech signal. 

Earlier work in this subject is in the form of a perceptual experiment 5 
in which H. Levitt, et al., characterized the perceptibility of slope- 
overload distortion as such. As mentioned earlier, our paper will seek 
to answer the complementary question of the relative perceptibilities 
of slope overload and granularity when they occur simultaneously in 
delta-modulated speech, as they usually do. 

The approach we used was to vary the relative amount of slope 
overload and granularity introduced into samples of a test utterance, 
and to evaluate these samples on the basis of both objective and per- 
ceptual criteria; and then to interpret these evaluations with specific 
reference to the overload-granularity dichotomy. 
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Section II summarizes the salient features of a computer-simulated 
adaptive delta modulator that was utilized in the present study. This 
adaptive encoder has been recently described and shown to provide 
toll-quality speech reproduction at bit rates of practical importance. 4 

Section III defines the objective measures of speech quality used in 
our study, while Section IV defines a subjective measure of preference 
and describes an underlying perceptual experiment. 



II. DESCRIPTION OF THE DELTA MODULATOR 

Figure 3 is a schematic block diagram of the adaptive delta modulator 
utilized in the present study. This encoder is defined by the basic 
equations (1) and (2), and by the adaptation rule 



A r = P-A r _! if C r = C r -y 



= ^-A r _, if C T * C r -i 



P > 1. 



(5) 



Notice that a conventional (linear) delta modulator corresponds to 
the special case of P = 1. In our study the value of P was a variable 
parameter; different (delta-modulated) speech samples corresponded to 
different suitably spaced values of P, and thereby to different mixtures 
of slope-overload and granularity. 

The original speech sample X was a 2-second male utterance of "Have 
you seen Bill?" that had been band-limited to 3.3 kHz. The delta 
modulation was performed at sampling rates of 20 and 40 kHz. The 
latter frequency provides speech reproduction that approaches telephone 
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Fig. 3 — Schematic diagram of an adaptive delta modulator. 
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Design of Digital Filter Banks for Speech 

Analysis 

By R. W. SCHAFER and L. R. RABINER 

(Manuscript received June 22, 1971 ) 

A bank of bandpass filters is often used in performing short-time spectrum 
analysis of speech signals. This paper is concerned with the analysis and 
design of digital filter banks composed of equally spaced bandpass filters. 
It is shown that significant improvement in the composite filter bank 
response can be achieved by proper choice of the relative phases of the band- 
pass filters. The results are extended to more general filter bank configura- 
tions. 

I. INTRODUCTION 

Many speech processing systems are based on the concept of short- 
time spectrum analysis. 1,2 Spectrum analyzers for such systems often 
consist of a set of bandpass niters Whose combined passbands cover a 
desired frequency range. Although continuous-time filters have tradi- 
tionally been used in filter banks for speech analysis, hardware realiza- 
tions of digital filters are now available, 3 and the advantages which 
digital filters offer should be exploited in filter bank design. These 
advantages include: flexibility of design of the individual bandpass 
filters, precision of realization, stability of digital hardware, and the 
efficiency of realization of the filter bank afforded by the possibility 
of multiplexing the digital hardware. Thus it is important to consider 
design techniques for filter banks composed of digital bandpass filters. 

To focus on the basic concepts in filter bank design, it is useful to 
define an ideal filter bank spectrum analyzer. Figure 1 depicts such a 
filter bank composed of digital filters whose impulse responses are 
denoted by h k (nT), k = 0, 1, • • • , M, where 1/T is the sampling 
frequency of the input signal.* Such a filter bank constitutes an ideal 
spectrum analyzer if the input x(nT) (with possibly further band limit- 
ing) can be synthesized exactly (within some fixed delay) by a linear 

* The filter ho(nT) is a lowpass filter which is included for completeness although 
this band is usually not analyzed in practical speech analysis systems. 
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Fig. 1 — Bank of digital bandpass filters. 



combination of the bandpass filter outputs x k (nT). An example of 
such a system would be one in which the filters are ideal rectangular 
bandpass filters with the same constant gain and linear phase in their 
passbands and zero gain outside. If the filter bandwidths are such 
that the frequency range 

— <« <- 

T ~ = T 

is completely covered without overlap, then the input can be synthesized 
exactly by adding together the outputs of the bandpass filters. 

The essential characteristics of the ideal spectrum analyzer are that 
the frequency response of the combined outputs must exhibit a flat 
magnitude response and a linear phase, and therefore the combined 
impulse response must be a delayed digital impulse (unit sample). 
Causal digital filters (filters whose impulse responses are zero for 
n < 0) cannot have the desired ideal gain characteristics and may 
not have linear phase.* Therefore a filter bank composed of such filters 
cannot achieve the ideal characteristics of flat magnitude response and 
linear phase. In this paper we will describe an approach to the design 
of filter banks that approximate the ideal spectrum analyzer. First 
we present a detailed analysis of a filter bank configuration in which 
equally spaced, equal-bandwidth digital filters are used. This analysis 
suggests a technique for optimizing the filter bank characteristics and 

* Finite duration impulse response digital niters can have precisely linear phase. 
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also suggests how the results can be used where nonuniform bandwidths 
are desired. The results are illustrated with examples. 

II. ANALYSIS OF UNIFORM FILTER BANKS 

Assume that the bandpass filters in Fig. 1 have impulse responses 
of the form 

h k (nT) = 2 | D k | h(nT) cos (<* k nT + $*) k = 1,2, ■■■ ,M 

h (nT) = | D | KnT) 

where o> t = co, + (k — 1) Aco, and h(nT) is the impulse response of a 
prototype lowpass filter. (When the k = lowpass filter is used, a>i = Aw 
and wo = 0.) The system functions for this set of bandpass filters are 

H k (z) = | D k | e i * k H(ze~ i " kT ) + | D t \ e~ i * t H(ze iUkT ) k = 1, 2, ■ ■ • ,M 

H (z)=\D \H(z). (2) 

The frequency response of these filters is obtained after substituting 
z = e' uT in (2) as 

H k {e iuT ) = | D k | e i *"H(e Ha - Ut)T ) + \ D k \ e -'* 4 ff( e '" ( " +w * )r ), 

fc= 1, 2, ••■ ,M, (3) 

#o(e' uT ) = \D \H(e iuT ). 

If the frequency response of prototype lowpass filter, H(e' uT ), drops 
off sharply, then it can be seen from (3) that 

| H k (e iuT ) | « | D k | | ^(e' (u - u ' ,r ) | ^ a. g tt/7' 

« | D k | | H(e Hu+Uk)T ) | - tt/7 7 ^ « ^ 0. 

In this case the filter bank consists of a set of (M +1) equally spaced 
bandpass filters with identical magnitude responses around their 
respective center frequencies. We have chosen this method of designing 
bandpass filters from lowpass prototypes for analytical convenience 
and because of the importance of spectrum analysis systems of this 
form. 1-2 The results to be discussed apply for other bandpass trans- 
formations in so far as they yield a set of uniformly spaced bandpass 
filters with identical frequency characteristics. 

Our objective is to choose the prototype lowpass filter and the 
parameters \ D k \, o>i , Au>, and $* so that the filter bank will closely 
approximate the characteristics of the ideal spectrum analyzer. To do 
this we must consider the response of the composite filter bank. First, 



3100 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1971 

however, it is useful to interpret the individual bandpass filter outputs 
in terms of spectrum analysis considerations. 
The individual filter outputs are of the form 

x k (nT)= 2 2x(rT) \ D k \ h(nT - rT) cos [a k (nT - rT) + * k ], (4) 
XoinT) = 2 *bT) | D | h(nT - rT) 



(5) 



(6) 



which can be expressed as 

x k (nT) = 2 Re {D k X(u k , n)e i " k " T ] 

x (nT) = Re {D X(0,n)} 
where D k is the complex constant defined by 

D k = | D k \e i * k k = 1, 2, ••• ,M 

Do = | Do I, 

and 

X(cu* , n) = £ x(rT)h(nT - rT)e io " rT . (7) 

r = -oo 

The quantity X(w* , n) is the discrete-time version of the short-time 
Fourier transform 4 of x(nT). Thus, (5) serves to relate the bandpass 
filter outputs x k (nT) to the short-time Fourier transform. 

The frequency response and impulse response of the composite filter 
bank are obtained from 

y(nT) = Jtx k (nT). (8) 

k = U 

After substituting (5) into (8) and noting that if x(nT) is real, X( — u kt n) 
is the complex conjugate of X(a k , n), we obtain 

y(nT) = E D k X(a> k ,n)e iatnT (9) 

k — M 

where co_t = — u> k , and D- k is the complex conjugate of D k . Substituting 
(7) into (9) and interchanging the order of summations results in 

y (nT) = £ x(rT)\h(nT -rT)- £ D k e iut(nT ~ rT) • (10) 

Defining 
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d(nT) = £ D tC '*"*- T J (11) 

we observe from (10) that the combined impulse response of the filter 
bank can be expressed as 

h(nT) = h(nT) d{nT). (12) 

Equations (11) and (12) are the basic results of the analysis. (Note 
that they could have been obtained directly by summing the impulse 
responses h k (nT), with the sacrifice of the interpretation of the filter 
bank outputs in terms of the short-time spectrum.) 

Equation (12) shows that the filter bank impulse response is the 
product of the prototype lowpass filter impulse response h(nT), and 
the sequence d{nT) defined by (11). The choice of lowpass filter depends 
on both the desired frequency resolution and the requirement of obtain- 
ing flat magnitude and linear phase response in the composite filter 
bank. The sequence d(nT) is independent of the prototype lowpass filter 
and is a function of the frequency spacing, the relative gains and phases, 
and the number of bandpass filters. Thus, for a given choice of prototype 
lowpass filter, the parameters of d(nT) can be adjusted to achieve the 
best approximation to the ideal spectrum analyzer. To see how this 
occurs, we shall first examine in detail the characteristics of the sequence 
d(nT). 

As will be shown in the remainder of this section, a particularly useful 
choice of the complex coefficients D k in (11) is 



(13) 



D k = e i * h "° T k = ±1, ±2, • • • , ±M. 
D u = 1, 
where n is an integer. That is, 

$ k = tbtfloT, and \ D k \ = 1. 

(The condition D = 1 implies that the band around co = is included 
in the filter bank; D = implies that it is not.) It can be shown that 
for this choice of D k , (11) becomes 

d(nT) = D + — . n [Au(nT + noT)/2] cos MnT + n T)] (14) 

where u„ = w, + (M ~ 1) Aw/2, and D n is 1 or depending on whether 
or not the lowpass channel is included. 

The properties of the sequence d(nT) determine the character of 
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the impulse response of the filter bank. Some of these properties are 
summarized below: 

(t) The parameter n shifts the sequence d(n T) by n samples with 

respect to h(nT). 
(ii) The sequence d{nT) is even about the sample n = —n ; i.e., 

d{nT - n T) = d(-nT - n T). 
{Hi) The maximum value of d{nT) occurs at n = —n . This value 

is d(-n T) = 2M + A> . 
(iv) If 2t/(AoiT) and wi/Aw are both integers, then the sequence 
d(nT) is periodic with period 2t/Ao>. Otherwise, d(nT) will be 
an almost periodic sequence which will peak up at time intervals 
of 2tt/ Aoi. 

Insight into the properties of d{nT) can be gained by considering a 
simple example. Assume that co, = Aoj and 2t/(AcoT) = N where N is 
an odd integer. That is, the entire frequency range —ir/T ^ w ^ ir/ T, 
is divided into N equal bands. If M = (N — l)/2, the entire frequency 
range is covered. Under these conditions (14) can be written 

sin ( 2M 9 + - ) MnT + n T) 
d( M = - sinW + n T)/2] ^ " « if ^ " ° (,r > il) 



sin 



J - if Do - 1. (15b) 



sin [Au,(nT + n r)/2] 

(If N is even, the fc = filter is not used, i.e., D = 0, and w, is Aco/2.) 
It is clear from (15b) that for these conditions d{nT) is a periodic 
sequence with period NT = 27r/Aa>. In fact, d(nT) may be thought of as 
samples of a continuous-time periodic Dirichlet kernel as shown in 
Fig. 2a. If M — (N — l)/2 and D Q = 1, d(nT) is a periodic discrete- 
time impulse train with impulses occurring at multiples of NT. This 
is because the sample points on the periodic Dirichlet kernel occur at 
the maxima and the zero crossings, as indicated by the small circles in 
Fig. 2a. 

The conditions for d(nT) to be periodic are that both o»i/Au and 
2ir/(Au}T) be equal to integers. To see this, we must examine (14) in 
detail. If 2tt/(AuT) is an integer, and M is an odd integer, the sequence 
2 sin [M Aa(nT + n T) /2]/sin [Aw(»T + n T)/2] is periodic with 
period NT = 2w/ Au. If oji/Aw is an integer and M is odd, the sequence 
cos [(cd[ + (M — 1) Ao)/2)(nT + n T)] is also periodic with a period 
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Fig. 2 — (a) Periodic continuous-time Dirichlet kernel, (b) continuous-time en- 
velope and sequence d(nT) when either wi/Aw or 2ir/(£aiT) are not integers. 



that is an integer multiple of 27r/Aco. Thus the product of these two 
sequences is periodic with period 2ir/Au. The identical result holds 
for M an even integer and 27r/Aco and u>i/A« integers although the 
interaction between the component sequences is slightly different. 

If either 2t/(AuT) or wi/Aco are not integers, d(nT) will not be 
periodic, but will still peak up at time intervals of 2jt/Aoj. Such a case 
is depicted in Fig. 2b where the samples d(i\T) are marked by the 
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small circles and the dotted curve shows the factor 

sin [M Ao,(nT + n T)/2]/sin [A<a(nT + n T)/2] 

when M is odd. As shown in Fig. 2b, d(nT) will always have even sym- 
metry about sample n = —n Q . 

III. DESIGN OF UNIFORM FILTER BANKS USING BESSEL FILTERS 

In the preceding section we presented a detailed analysis of a filter 
bank composed of equally spaced equal -bandwidth filters. In this 
section we will show how the results of that analysis can be employed 
in filter bank design. 

The objective of flat amplitude response and linear phase is most 
easily achieved with bandpass filters having these same properties. 
For this reason, Bessel (maximally flat delay) filters are often used in 
filter banks. 5 In the examples shown in this paper, we have used digital 
filters obtained from Bessel prototype designs using impulse invariance. 6 
It should be noted that the digital filters obtained this way do not have 
the maximally flat delay property. J. P. Thiran 7 has shown that the de- 
nominator of the system function of maximally flat delay digital filters 
is a Gauss hypergeometric function. It is reasonable to expect however, 
that for the narrow-band filters of interest here, the differences should 
be negligible. 

As an example a digital filter derived from a sixth-order Bessel 
lowpass filter with asymptotic cutoff frequency of 60 Hz is shown 
in Fig. 3. The impulse response is shown in Fig. 3a, and the amplitude 
and phase responses are shown in Fig. 3b and Fig. 3c. The filter shown in 
Fig. 3 was used in a filter bank* with the following choice of parameters: 
Do = 0, T = 10" 4 sec, Acu = 2tt(100), w x = 2tt(100), n = 0, and 
M = 30. The resulting filter bank characteristics are shown in Fig. 4. 
The filter bank impulse response, h(nT), is shown in Fig. 4a along 
with the prototype lowpass impulse response h(nT). For the above 
choice of parameters, d(nT) is obtained from (15a) as 

,. m . sin [0.6l7rn] t , 1ft .. 

sin [0.0l7m] 

which is periodic with period 100 samples (10 msec), with peaks occurring 
at nT = 0, ±10, ±20, • • • msec. From Fig. 4a, it can be seen that 
in the product h{nT)-d{nT), the peak of d(nT) at nT = will be 
attenuated since h(nT) is small around nT = 0. On the other hand, 



* Note that the resulting bandpass filters are twelfth order. 
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Fig. 3 — Sixth-order Bessel filter characteristics, (a) Impulse response, (b) mag- 
nitude response, (c) phase response. 
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Fig. 4 — Characteristics of 30-channel filter bank, (a) Impulse response (dotted 
curve is the impulse response of the prototype lowpass filter in Fig. 3), (b) composite 
magnitude response, (c) composite phase response after subtracting 10-msec delay. 
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the peak of d{nT) at nT = 10 msec occurs at approximately the peak 
of h(nT), and at nT 1 = 20 msec, h(nT) is large enough to produce a 
significant echo in the impulse response of the filter bank. As is shown 
in Fig. 4b and 4c, this corresponds to a 3.9-dB ripple in the amplitude 
response and a 25.5-degree peak-to-peak ripple in the phase response 
(after removing a linear phase component corresponding to a 10-msec 
or 100-sample delay). To decrease this amplitude and phase ripple, we 
should attempt to eliminate the echo in the impulse response. Further- 
more, the phase ripple will be eliminated if the impulse response h(nT) 
has even symmetry about some delay time n D T. One approach is to 
broaden the filter bandwidths, or equivalently reduce the spacing Aa>, 
so that h(nT) is contracted relative to the spacing of pulses in d(nT). 
This is generally not an acceptable solution since h(nT) and Ato are 
usually fixed by some frequency resolution criterion. However, if we 
refer to the properties of dinT) which were previously summarized, 
we note that a negative value of n will shift d(nT) to the right relative 
to h(nT) so that d(nT) will have even symmetry about time n D T = 
—n T. If n can be chosen so than h(nT) = h(nT)-d(nT) has approxi- 
mately even symmetry and consists of only one significant pulse, then 
the amplitude and phase ripple will be small. The manner in which this 
is achieved is shown in Fig. 5 where it is assumed for simplicity that 
d(nT) is a train of digital impulses as would be the case for M = (N — 
l)/2. Figure 5a depicts the case where n = 0. Figure 5b shows the 
situation where n„ was chosen to shift the impulse which was at n T = 
in Fig. 5a to the right and into the vicinity of the peak of h(nT). If it 
is assumed that only three impulses have nonzero amplitudes (a, , 
a 2 , a :i ) such that 4 | a, | • | a 3 | < | a, + a 3 | • | a 2 |, then it can be shown 
(see Appendix) that the peak-to-peak amplitude ripple of the filter 
bank is 



R A = 20 lo gl0 



. "2 



a 2 + Qfi + a 3 M Qy\ 

*i — «3 |J 



Similarly, if | a, + «.i I < I «»|j the peak-to-peak phase ripple about 
a linear phase corresponding to a delay of — n T is given by 



R P = 2 tan ' ^ — ~, — ?* x2 X i/ 2 ■ 

L(«2 — («1 + Ots) ) J 



(18) 



The conditions for (17) and (IS) to hold are satisfied when a, and a 3 
are small relative to a 2 , which is the normal situation. It can be seen 
from (18) and (17) that the phase ripple will be zero if a x = a 3 , and 
the amplitude ripple will be small if (a, + a 3 )/a 2 is small. 

Although these results were derived for the idealized case when d(nT) 
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Fig. 5 — Illustration of how to adjust the parameter n . (a) Composite impulse 
response for n = 0, (b) no chosen to minimize magnitude and phase ripple (dotted 
lines indicate movement of individual pulses in d(nT)). 



is an impulse train, we have found that amplitude and phase ripple 
can be determined quite accurately using (17) and (18) in more general 
situations. With the foregoing principles in mind we have written an 
interactive computer program for filter bank design. Using this program 
we can design a filter bank with low amplitude and phase ripple by 
the following process: 

(i) Choose co, , Aa>, and M to cover the desired analysis band and 

choose h(nT) to provide desired frequency resolution. This 

results in an h(nT) that has a duration of approximately 4tt/Aw 

as shown in Fig. 5. 

(n) Evaluate h(nT) and determine n such that a, « a 3 as in 

Fig. 5b. 
(Hi) If the resulting filter bank is not satisfactory, steps i and ii 
are repeated. 

In cases where wj/Aw is not an integer, it is important to choose n n 
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so that the point of even symmetry in d(nT) is shifted into the vicinity 
of the peak of h(nT). Otherwise, it may be impossible to achieve a very 
good approximation to linear phase. An example of the improvement 
gained by proper choice of n is shown in Fig. 6. In this example all 
the parameters were the same as in the example of Fig. 4 except a value 
of n — — 129 was chosen by the above process. In this case the in-band 
amplitude ripple is 0.8 dB and the phase ripple is 0.6 degree, as compared 
to 3.8 dB and 25.5 degrees when ??„ = 0. 

R. M. Golden 5 has shown that inverting the sign of alternating chan- 
nels often significantly improves the characteristics of a filter bank. This 
technique has a simple interpretation in terms of our results. It can be 
shown that inverting the sign of alternating channels is equivalent to 
delaying the sequence d(nT) by n = —ir/(AuT) samples. This amount 
of delay may be nearly correct if the duration of h(nT) is approxi- 
mately 3ir/Aco; however for the situation shown in Fig. 4a, such a delay 
would produce a worse filter bank than no delay at all (n = 0). Also, 
to achieve linear phase when oji/Aoj is not an integer, the point of even 
symmetry in d(nT) should be delayed to the vicinity of the peak of 
h(nT). This does not occur when the signs of alternate channels are 
inverted. 

IV. DESIGN OF NONUNIFORM BANDWIDTH FILTER BANKS 

In speech applications it is common to take advantage of the fre- 
quency resolution characteristics of the ear 4,5 by using increasing 
bandwidth filters at higher frequencies. The previously discussed tech- 
niques can be applied to this situation if the filter bank consists of 
several sub-banks, each with different resolution. Each sub-bank can 
be designed as discussed above, with care being taken to ensure that 
the entire frequency band of interest is covered by the combination 
of the sub-banks. It may be necessary to equalize the delay between 
sub-banks by providing additional delay for all but one of the sub- 
banks.* This is depicted in Fig. 7 for three sub-banks with increasing- 
bandwidth sixth-order Bessel filters. Figure 7a shows the lowpass 
prototype impulse response and shifted d(nT) sequence t for the first 
sub-bank. The lowpass asymptotic cutoff used was 78 Hz, the spacing 
of filters was Aaii = 2ir(125), the first filter was centered at u n = 
2r(250), and a value of n ltl = —100 (10-msec delay) was required to 

* Golden 5 has shown that the delays can be approximately equalized by increasing 
the order of the lowpass prototype in direct proportion to the increase in bandwidth. 

f The sequence cl(nT) is shown as an impulse train for convenience in plotting. 
The actual sequences would look like those in Fig. 4 and Fig. 6. 
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Fig. 6 — Characteristics of 30-channel filter bank, (a) Impulse response for n = 
— 129 (dotted curve is the impulse response in Fig. 3), (b) composite magnitude 
response, (c) composite phase response after subtracting 12.9-msec delay. 
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minimize the amplitude and phase ripple. Figure 7b shows the second 
sub-bank in which the basic parameters were: lowpass asymptotic 
cutoff 136 Hz, Aco 2 = 2tt(21S), w 12 = 2tt(1296.5), and n 02 = -57 
(5. 7- msec delay). To line up the central peaks, an additional delay 
of n 2 = 43 samples (4.3 msec) was required. Figure 7c shows the third 
sub-bank where the lowpass cutoff was 192 Hz, Aw 3 = 27r(307), w 13 = 
2ir(2213), and n 0:) = — 40 (4-msec delay). A value of n 3 = 60 samples 
(6.0 msec) is required to line up the central peak with those in Fig. 7a 
and 7b. The response of the combination of these three sub-banks is 
shown in Fig. 8. Figure 8a shows the impulse response, Fig. 8b shows 
the amplitude response, and Fig. Sc shows the phase after a linear 
phase corresponding to 10-msec delay has been subtracted. It can be 









\ (a) 


< r>oi T / 




> 




jy 


2ir 

Ad), 


* 


^>n 1 . 



20 nT(msec) 




20 nT (msec) 



20 nT(msec) 



Fig. 7 — Illustration of the design of nonuniform filter banks: (a) impulse response 
for narrow bandwidth filters, fb) impulse response for intermediate bandwidth filters, 
(c) impulse response for wide bandwidth filters. 
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Fig. 8 — Composite filter bank characteristics with three different sub-banks : (a) 
impulse response, (b) magnitude response, (c) phase response after subtracting 
10-msec delay. 
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seen in Fig. 8b and 8c that the ripple in the sub-banks is quite low 
as would be expected from Fig. 7. At the boundary between sub-banks, 
however, the ripple increases significantly due to the fact that the last 
filter in the lower sub-bank drops off more rapidly than the first filter in 
the next sub-bank. This excessive variation at the boundary between 
sub-banks can be eliminated to some extent by using increasingly 
higher-order filters in the sub-banks. Alternatively, nonuniform resolu- 
tion can be obtained by using equal-bandwidth filters and adding 
together groups of two or more of their outputs to achieve the desired 
bandwidth. Such an approach would require increased computation 
but would produce filter bank characteristics comparable to those in 
Fig. 6. 

V. CONCLUSION 

We have discussed the analysis and design of digital filter banks 
and have shown how the incorporation of a linearly increasing phase 
shift in each bandpass filter can significantly improve the overall filter 
bank characteristics. We also showed how the techniques can be used in 
nonuniform bandwidth filter banks. 

The examples which we gave were based on Bessel lowpass prototypes 
which have impulse responses of desirable shape but rather poor am- 
plitude response. Recent results in the design of finite duration impulse 
response filters 8 offer attractive possibilities for filter bank design. 
Such filters can have precisely linear phase and can be designed using 
iterative techniques with constraints on both the impulse response 
shape and the amplitude response. The use of such filters, together with 
the basic principles discussed in this paper, should yield filter banks 
with excellent properties. 

APPENDIX 

Derivation o] Magnitude and Phase Ripple Formulas 
Assume an impulse response sequence 

h(n) — a, n = 
= a 2 n = n v 
= a 3 n = 2n p 

= elsewhere. (19) 

The system function of this system is 

H(e i " T ) = a, + a 2 <r""" r + a*-"**'. (20) 
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The squared magnitude response is 

I H(e iaT ) | 2 = [a, + (a, + a,) cos («n,T)] 2 + (a, - a 3 ) 2 sin 2 («n,D, 

(21) 

and the phase response is 

yrr, ia,T\-> -l («i — ct 3 ) sin (coripT) . . 

arg [#(e' )] = tan — V- L 7 , > — * ' (22) 

L« 2 + («i + « 3 ) cos (con p T)J 

where a linear phase component — un p T has been removed. Clearly, 
both (21) and (22) are periodic functions of co with period 2tt /n„T. 
To determine the amplitude and phase ripple, we must locate the 
maxima and minima of (21) and (22). 

If we differentiate (21) with respect to co, we find that the maxima 
and minima occur for values of co satisfying 

sin ipm p T) = (23a) 

cos (om p T) = -<x 2 (g ^" g,) - (23b) 

The second equation is satisfied by a real value of co if and only if 

4 | <xi |'| Og | > I «i + a 2 |'| a 2 |. (24) 

In a good filter bank design, a, and a 3 will be positive and much smaller 
than a 2 , and (24) will not be satisfied. Evaluating the second derivative 
shows that in this case the maxima and minima of | H(e' uT ) \ will 
alternate and occur at values of co satisfying (23a); i.e., co = 0, ±ir/n p T, 
±2ir/n p T, • • • .In this case the amplitude ripple in dB is given by 



p on 1™ «2 + «1 + «3 

H A = 20 log 10 i r • 

L a 2 - «i - «3 J 



(25) 



If (22) is differentiated with respect to co, we find that the maxima 
and minima occur at values of co satisfying 

cosco<T= -(^ J ^ 1 )- (26) 

Equation (26) is satisfied by real values of a if | a x + a 3 | < | a 2 |. 
In this case the maxima and minima again alternate, and the peak- 
to-peak phase ripple is 

R p = 2 tan" 1 \ , tt ' " " 3 2 1/2 1 • (27) 

_(a 2 — (a, + a 3 ) ) J 



DIGITAL FILTER BANKS 3115 

If | a, + a 3 | > | a 2 I, the phase curve will be discontinuous with a 
jump of 27r radians occurring at u = ±ir/n v T, ±3ir/n P T, • • • . 
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